Automatically deriving structured knowledge bases from on-line dictionaries1
نویسندگان
چکیده
keywords: computational lexicography; lexical knowledge bases We describe an automated strategy which exploits on-line dictionaries to construct a richly-structured lexical knowledge base. In particular, we show how the Longman Dictionary of Contemporary English (LDOCE) can be used to build a directed graph which captures semantic associations between words. The result is a huge and highly interconnected network of words linked by arcs labeled with semantic relations such as Hypernym, Part_of, Location, and Purpose. We argue that this knowledge base provides much more detailed information about word meanings than can be obtained using standard lexical lookup procedures or by relying on statistical measures of semantic associations among words. 1We would like thank the other members of the Microsoft Natural Language group: Joseph Pentheroudakis, Karen Jensen, George Heidorn, and Diana Peterson.
منابع مشابه
Improving Clinical Diagnosis Inference through Integration of Structured and Unstructured Knowledge
This paper presents a novel approach to the task of automatically inferring the most probable diagnosis from a given clinical narrative. Structured Knowledge Bases (KBs) can be useful for such complex tasks but not sufficient. Hence, we leverage a vast amount of unstructured free text to integrate with structured KBs. The key innovative ideas include building a concept graph from both structure...
متن کاملA DL Semantics for Reasoning over OVM-based Variability Models
Software Product Line (SPL) development has traditionally included Variability Management as a way of defining, modelling, implementing and testing variability. In this context, we have created a framework, SeVaTax, based on extensions of the Orthogonal Variability Model (OVM), and aimed at analysing properties of variability models and deriving products from an SPL. Despite several approaches ...
متن کاملLearning to Extract Relations from MEDLINE
Information in text form remains a greatly underutilized resource in biomedical applications. We have begun a research effort aimed at learning routines for automatically mapping information from biomedical text sources, such as MEDLINE, into structured representations, such as knowledge bases. We describe our application, two learning methods that we have applied to this task, and our initial ...
متن کاملAutomatic Discovery of Preservation Alternatives Supported by Community Maintained Knowledge Bases
Preservation Planning, which deals with selecting the most appropriate preservation action to be applied to digital objects, is an important step in any digital preservation activity. Comprehensive Preservation Planning depends on the availability of identified alternatives of preservation actions, which are for example file format migrations to migrate data in an outdated format to one that ha...
متن کاملLearning Knowledge Bases for Information Extraction from Multiple Text Based Web Sites
We describe a learning approach to automatically building knowledge bases for information extraction from multiple text based web pages. A frame based representation is introduced to represent domain knowledge as knowledge unit frames. A frame learning algorithm is developed to automatically learn knowledge unit frames from training examples. Some training examples can be obtained by automatica...
متن کامل